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Abstract 

Heritability estimates for body mass index (BMI) variation are high. For mothers and their offspring higher BMI correlations 
have been described than for fathers. Variation(s) in the exclusively maternally inherited mitochondrial DNA (mtDNA) might 
contribute to this parental effect. Thirty-two to 40 mtDNA single nucleotide polymorphisms (SNPs) were available from 
genome-wide association study SNP arrays (Affymetrix 6.0). For discovery, we analyzed association in a case-control (CC) 
sample of 1,158 extremely obese children and adolescents and 435 lean adult controls. For independent confirmation, 7,014 
population-based adults were analyzed as CC sample of n = 1,697 obese cases (BMI>30 kg/m^) and n = 2,373 normal weight 
and lean controls {Bl\/ll<25 kg/m^). SNPs were analyzed as single SNPs and haplogroups determined by HaploGrep. Fisher's 
two-sided exact test was used for association testing. IVloreover, the D-loop was re-sequenced (Sanger) in 192 extremely 
obese children and adolescents and 192 lean adult controls. Association testing of detected variants was performed using 
Fisher's two-sided exact test. For discovery, nominal association with obesity was found for the frequent allele G of 
m.8994G/A {rs28358887, p = 0.002) located in ATP6. Haplogroup W was nominally overrepresented in the controls 
(p = 0.039). These findings could not be confirmed independently. For two of the 252 identified D-loop variants nominal 
association was detected {m.l6292C/T, p = 0.007, m.l6189T/C, p = 0.048). Only eight controls carried the m.l6292T allele, 
five of whom belonged to haplogroup W that was initially enriched among these controls. m.l6189T/C might create an 
uninterrupted poly-C tract located near a regulatory element involved in replication of mtDNA. Though follow-up of some 
D-loop variants still is conceivable, our hypothesis of a contribution of variation in the exclusively maternally inherited 
mtDNA to the observed larger correlations for BMI between mothers and their offspring could not be substantiated by the 
findings of the present study. 
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Introduction 

Twin, adoption and family studies have shown that the 
heritaMity of the variation in body mass index (BMI) is high [1- 
4]. Genetic factors explain about 40% to 70% of the variance of 
the BMI [5]. Some family and adoption studies showed higher 
correlations in BMI between mothers and their offspring 
compared to fathers and their offspring e.g. [4, 6, 7]. In a study 
of 540 adult Danish adoptees, for instance, BMI correlation 
between biological mothers and offspring were 0.15 compared 
with 0.11 between biological fathers and offspring [4]. Among 
maternal half-brothers this effect was also shown as depicted by a 
two-fold higher BMI correlation (r = 0.21) compared to paternal 
half-brothers (r^O.ll) [8]. 

During the last years, several monogenic and polygenic forms of 
obesity have been elucidated (summarized in [5]). Large-scale 
genome-wide association studies (GWAS) and independent 
confirmation (up to 250,000 individuals in total) revealed several 
BMI and/ or body weight associated loci. However, these studies 
have only focused on autosomal SNPs e.g. [9-11]. 

The circular mitochondrial DNA (mtDNA) comprises 
16,569 bps. Somatic cells usually harbor about 1,000 to 10,000 
mtDNA molecules [12]. mtDNA encodes 37 genes of which 13 are 
protein coding subunits of the oxidati\ e phosphorylation system 
(OXPHOS) [13]. In addition, mtDNA consists of a 1,100 bps non- 
coding control region known as the mitochondrial displacement 
(D)-loop [13]. The D-loop consists of two to three hypervariable 
parts [14,15]. Both transcription and rephcation are coordinated 
at the D-loop [12] (Table 1). 

As mtDNA is exclusively maternally inherited, variation in 
mtDNA might contribute to the above mentioned higher 
correlation in BMI between mothers and their offspring. Up to 
date, only two GWAS of mtDNA variants in association with BMI 
have been performed [16,17]. Yang et al. reported association of 
the mitochondrial haplogroup X with a lower BMI in a sample of 
2,286 unrelated adult Caucasians [16]. The finding was not 
confirmed in an independent sample [16]. A GWAS on both 
European-American and African-American case-control (CC) 
samples of obese and lean children did not reveal association of 
BMI with any mtDNA variant or with heteroplasmy [17]. 

In the current analysis, we analysed GWAS data of mtDNA 
variants in extreme early onset obesity by using a CC sample of 
1,158 extremely obese children and adolescents and 435 lean adult 
controls of German descent. Our findings were foUowed-up in 
7,014 German population-based adults. Moreover, in a sub- 
sample of 384 individuals of the initial CC sample, the D-loop was 
re-sequenced in order to detect further variants potentially 
associated with obesity. 

Subjects and Methods 

Study subjects 

Ethics statement. Written informed consent was given by all 
participants. The study was approved by the Ethics Committees of 
the Universities of Marburg, Essen, Greifswald and Kiel, and the 
Bavarian Medical Association. It was conducted in accordance 
with the Declaration of Helsinki. 

Discovery GWAS sample. The CC GWAS sample consisted 
of 1,158 (extremely) obese children and adolescents and 435 
normal weight or lean adult controls. The controls and 453 cases 
were derived from a CC GWAS sample [18]. The additional 705 
cases were index cases from a family-based GWAS sample of 705 
obesity trios (one (extremely) obese child or adolescent [== index 
case] and both biological parents) [18]. Lean adults as controls 



who were never overweight or obese during childhood (as assessed 
by interview) were used as this was thought to reduce the chances 
of misclassification compared with the use of lean children as 
controls who might become overweight or obese in adulthood 
[19]. The measured body mass index (BMI; in kg/m^) was 
assessed for extremeness using age- and sex-specific percentile 
criteria for the German population from the National Nutrition 
Survey I [20]. According to this reference population, 84% of all 
cases were extremely obese (BMI >99* percentile). The lean 
controls had a mean BMI of 18.31 ±1.11 kg/m^ (Table SI). 

Confirmation GWAS sample. For confirmation of initial 
findings, three population-based adult GWAS samples were used 
(n= 7,014, Table SI). (1) KORA: this sample is a sub-sample of 
KORA F4, which is an epidemiological study group of the region 
of Augsburg (Cooperative Health Research in the Region of 
Augsburg) [21] and comprised 1,743 adult participants (890 
females). (2) SHIP: "The Study of Health in Pomerania" is a 
population-based project in Northeast Germany comprising 4,308 
individuals aged 20 to 79 years at recruitment. Of these, 4,073 
individuals (2,067 females) were genotyped with the Aflymetrix 
Genome-Wide Human SNP Array 6.0 and included in the 
analysis [22]. (3) POPGEN: the 1,198 individuals (524 females) of 
POPGEN (age: 19 to 77 years) genotyped with the Affymetrix 
Array 6.0 are from a population-genetic research project founded 
at the University Hospital of Schleswig-IIolstein [23]. 738 subjects 
(336 females) were recruited via the local population registry and 
460 (188 females) as blood donors. BMI of the individuals 
recruited via the local population registry was estimated by self- 
report, while BMI of the blood donors was measured. 

As simulations have shown that genetic markers with an effect in 
the extremes of a trait are detected more solidly within a CC 
design compared with a linear regression design [24], we 
converted the population-based samples into a CC sample 
categorizing all individuals with a BMI S:30 kg/m^ as obese cases 
(n= 1,697) and those with a BMI <25 kg/m^ as normal weight 
and lean controls (n = 2,373; Table SI). 

D-loop sample. The D-loop was re-sequenced in 192 
extremely obese cases and 192 lean controls. These individuals 
were derived from the initial CC GWAS sample apart from 14 
cases and six controls. Mean age and BMI were similar to those 
found in the initial CC GWAS sample (Table SI). 

Molecular genetic analysis 

Genotyping. AU individuals were genotyped by the Affyme- 
trix Genome- Wide Human SNP Array 6.0. This array covers 1 15 
to 119 mtDNA SNPs (Table S2). The following quality control 
(QC) criteria were applied: (1) SNP call-rate above 95%, (2) minor 
allele frequency (MAP) above 1%, and (3) cluster graphs checked 
independentiy by two raters for clear separation of both alleles. 
Only 32 to 40 SNPs passed these criteria as most of the mtDNA 
SNPs were monomorphic or had a very low MAF (Table S2). 

Variant detection by re-sequencing (Sanger) of 
mitochondrial D-loop. Primers were selected according to 
Cardoso et al. [25] (Fig. SI). Instead of H616 [25], H715 was 
chosen to include the sequence of the second heavy strand 
promoter (HSP2), whose start is located at m.644 [26,27], i.e. 
outside of the actual D-loop (m.576 to m.l6024, [28]) (Fig. SI). 
The outer primers L15988 and H715 were used for amplification 
of the D-loop (primer annealing temperature: 65°C). 

Re-sequencing was performed by LGC Genomics Berlin, Germany 
using all four primers. Received electro-pherograms were evalu- 
ated manually using Seqman Pro (v.10.1.0 (174), 419, DNASTAR, 
Inc., Madison (WI), USA). The D-loop sequence of the revised 
Cambridge Reference Sequence (rCRS, [29]) was copied into a 
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Microsoft Office Excel 2007 (Microsoft Coop., Redmond (WA)) sheet, so 
that each cell comprised ten nucleotides and each line 60 
nucleotides. Deviations from this reference were noted below the 
reference in a separate Excel sheet for each individual. Evaluation 
of electropherograms was performed by two independent raters. 
Discrepancies were resolved unambiguously by either reaching 
consensus or re-sequencing. 

Variant detection by re-sequencing (Sanger) of complete 
mitochondrial DNA. Complete mtDNA was re-sequenced in 
five cases and five controls of the discovery GWAS sample by 
Seqlab Gottingen, Germany. Evaluation of received electropherograms 
was performed manually as described above. 

Determination of haplogroups 

The haplogroup of each individual of the discovery and 
confirmation sample was determined with HaploGrep [30,31]. 
For this analysis, HaploGrep software provided at http:// 
haplogrep.uibk.ac. at/was downloaded. AU available mtDNA 
SNPs from each individual (32 to 40 mtDNA SNPs, Table 82/ 
Table S3) were entered. Each individual's haplogroup was 
determined based on PhyloTree.org (mtDNA tree build 11, [31]) 
which is implemented in the software. Only those individuals 
whose haplogroup quality value was rated >90% were included 
into statistical analyses. At this quality threshold haplogroup 
assignment is quite reliable according to HaploGrep's manual. For 
association testing, haplogroups were assigned to major hap- 
logroups. 

Statistic tests 

For discovery, association testing was performed using Fisher's 
two-sided exact test for both single SNP and haplogroup analysis. 
Analyses were also performed stratified by gender. Odds ratios and 
confidence inter\'als were determined. For independent confirma- 
tion, nominally associated SNPs or haplogroups were foUowed-up 
in the confirmation sample - in the whole sample or stratified by 
gender depending on the discovery finding. 

Frequencies of detected variants by D-loop rc-scquencing were 
compared between cases and controls using Fisher's two-sided 
exact test. Moreover, average number of variants in 23 defined 
functionally relevant regions of the D-loop (adapted from www. 
mitomap.org; Table 1, [32]) was compared between cases and 
controls with a t-test. 

Results 

Single SNP analysis 

Association analysis was performed with 35 mtDNA SNPs in 
the whole discovery GWAS sample of 1,158 (extremely) obese 
children and adolescents and 435 lean and normal weight adult 
controls. Five further SNPs were analyzed in this sample excluding 
the 705 cases from the family-based trio GWAS sample, as these 
five SNPs did not pass QC in this sample (Table S3). Nominal 
association was found for m.8994G/A (p = 0.002), whose minor 
allele A was more frequent among the controls (3.92% vs. 1.30%; 
Table 2). m.8994 is located in the ATP6 gene. The G/A transition 
at m.8994 is synonymous. Stratified by gender, nominal associ- 
ation was found for both male and female subjects (Table 2). In 
female subjects, nominal association was found for three further 
SNPs, while in male subjects one further SNP was nominally 
associated (Table 2). 

Follow-up of m.8994G/A in a sample of 1,697 obese cases and 
2,373 normal weight controls from three adult population-based 
GWAS samples did not lead to an independent confirmation of 
the initial finding. SNPs initially associated in only female or male 



subjects could neither be confirmed. In addition, direction of effect 
for most of these SNPs was different between discovery and 
confirmation (Table 2). 

Haplogroup analysis 

Using HaploGrep [30] based on phylotree build 11 [31], we 
identified 80 haplogroups with a quality value ^90%. This quality 
threshold was reached by 96% of all study subjects from both 
discovery and confirmation. Most of the identified haplogroups 
(94% to 95% depending on sample) could be assigned to eight 
European major haplogroups (H, U, T, V, J, K, W, X). 

For discovery analysis, we detected nominal association with 
obesity for haplogroup W (p = 0.034; Table 3, Table S4), which 
was - just as the minor allele A of m.8994 - more frequent among 
the lean adult controls. All individuals of haplogroup W were 
minor allele carriers of m.8994. Stratified by gender, nominal 
association of haplogroup W remained only for male subjects 
(p = 0.012). Among the females we detected nominal association 
for haplogroup J (p = 0.032). Compared to die single SNP GWAS 
findings, none of the initially associated haplogroups could be 
confirmed in the independent sample, and directions of effect were 
opposite (Table 3). 

Re-sequencing of mtDNA D-loop 

We re-sequenced the D-loop in 192 extremely obese cases and 
192 lean controls, which were derived from the initial discovery 
sample apart from 20 individuals. This was done, because prior 
complete re-sequencing of mtDNA of 10 individuals (eight of these 
had haplogroup W) revealed that among the individuals of 
haplogroup W inter-individual variability in the D-loop (Table S5) 
was larger compared with the coding region (Table S6). Moreover, 
the coverage of the D-loop by only one SNP on the SNP array 
(Table S3) was insufficient for association analysis for D-loop 
variants. Finally, the D-loop is an important control region 
pertaining to transcription and replication of mtDNA, and 
variation in this region might have an impact on these processes 
and interfere with body weight. 

We excluded one case and one control each from the 384 
individuals whose D-loop was re-sequenced from further analysis, 
as we detected 9 and 4 clearly visible point heteroplasmies. Point 
heteroplasmies are usually rather infrequent especially in blood 
cells [33,34], from which DNA was extracted, and might be an 
indication for contamination with foreign DNA [35]. 

In the remaining 382 individuals we detected a total of 252 
deviations from the rCRS [29], four of which were not located in 
the actual D-loop (i.e. between m.576 and m. 16024, Table S7/ 
Table S8). Of these, 223 were single nucleotide exchanges at 213 
positions, as at 10 positions tri-aUehc exchanges were present, 
three were complex nucleotide exchanges (i.e. a combination of a 
single nucleotide exchange and an insertion as for instance 
m. 16183A/CC), 18 were insertions and eight were deletions. A 
major part of the detected variants have been already described 
(www.mitomap.org, last edited on Apr 23, 2013, Table S7/Table 
S8) [32]. 

Each one point heteroplasmy was detected in four cases and 
nine controls. Moreover, we detected length heteroplasmies at four 
pre\'i()usly known k'ngth heteroplasmic mtDNA regions (Fig. 1). 

The average number of variants per individual was 8.3. This 
frequency did not differ between cases and controls (p = 0.989). 
We also compared the frequencies of each of the 252 detected 
variants between cases and controls, and found m.l6292C/T 
(p = 0.007) and m.l6189T/C (p = 0.048) to be nominaUy associ- 
ated with obesity, m. 16292C/T was present only in eight controls, 
of which five had haplogroup W, whose frequency was nominally 
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higher in the initial CC GWAS sample and tended to be higher 
among the controls of the D-loop sample (p = 0.062). By contrast, 
m. 16189T/C had a higher frequency among the cases (17% vs. 
9%). 

A transition at m. 16189 might create an uninterrupted poly-C 
tract of 10 Cs (Fig. 1), in case no further transition has occurred 
between m.l6184 and m.l6193. Among the cases, 15% had such 
an uninterrupted poly-C tract, while only 10% among the controls 
(p = 0.116). 

We detected no differences when comparing average number of 
variants per individual between cases and controls for the 23 
functionally relevant D-loop locations (Table 1). 

Discussion 

Due to its maternal inheritance, we addressed the question if 
variation in mtDNA might contribute to the observed larger 
correlation for BMI between mothers and their offspring than 
between fathers and their offspring [4,6,7] or maternal versus 
paternal half-brothers [8] . Therefore, we performed an association 
study for mtDNA SNPs predominantly from the mtDNA coding 
region in extremely obese children and adolescents versus lean 
adult controls and re-sequenced the D-loop in a sub-sample of this 
CC study group. 

For the coding region variants, no single variant or haplogroup 
was robustly associated with obesity. The results are in accordance 
with a previous study [17]. Yang et al. showed association of 
haplogroup X with lower BMI in a sample of 2,286 unrelated 
adult Caucasians; however, these data were not confirmed in an 
independent sample [16]. In our study, haplogroup X was not 
associated with BMI/leanness in the young or the adult sample. 
Nevertheless, in the adults the direction of effect was the same as 
described previously (Table 3; [16]). 

We considered that our initial findings were spurious, as the 
direction of effect of each haplogroup and of all but one SNP 
differed between discovery and confirmation (Table 2). However, 
an additional explanation might be that different mtDNA SNPs 
and/or haplogroups might be relevant for children and adoles- 
cents than for adults pertaining to BMI [18]. This could explain 
why we were not able to replicate association of haplogroup X 
[16] with low BMI in the discovery (p = 1.0), while a minor trend 
(p = 0.12, consistence of direction of risk allele) was found in the 
adults (confirmation). Obviously, based on the sample sizes we 
cannot exclude the existence of associations of small effect sizes. 

Haplogroup assignment was performed with HaploGrep [30] 
based on Phylotree built 11 [31] with the genotype information of 
the up to 40 mtDNA SNPs from the SNP array. Bandelt et al. 
rated HaploGrep to be more sophisticated than the other 
programs which are able to assign haplogroups automatically 
Q [36]. All major European haplogroups - except haplogroup I - 

^ S were found, and their frequencies were in accordance with those 



expected among West Europeans (www.mitomap.org, [32]) or 

§ B Si g Germans ([37]; Table S9). Individuals could not be assigned to 

.^11 g haplogroup I, as SNPs at m. 10034, m. 16 129 and m. 16391 leading 

2! o ^ to haplogroup I or any variant which would have led to a sub- 

.2 £ -g o haplogroup of haplogroup I were absent from the SNP array 

X 5 o c (www.phylotree.org, built 11; Fig. S2; [31]). Haplogroup I 

:§ I S branches-off of Nle'l (www.phylotree.org, built 11; Fig. S2; 

^ jj u £ [31]). In West Europeans and Germans, Haplogroup N occurs 
S S tj o 

"D "a TO ^ 



with a frequency of 1% (www.mitomap.org, [32]) and 0.6% [37], 
respectively, but among the individuals of the current study 
samples, the frequency was 2.5 to 3.5% (Table S9). Thus, 
§ g i£ ;5 individuals actually belonging to haplogroup I might have 

"remained" in haplogroup Nl (Fig. S2). 
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Figure 1 . Detected length heteroplasmies. Detection of length heteroplasmles, I.e. mixtures of various lengths of a certain mtDNA region In one 
Individual, occurred at four locations in the D-loop (m.15024 to m.576), predominantly at poly-C tracts. Numbering according to rCRS [29]. 
doi:1 0.1 371/journal.pone.0094882.g001 



Haplogroup association testing was restrained to major 
haplogroups, because of the limited numl)C'r of SNPs on the 
SNP array which disabled a refined haplogroup determination for 
some individuals. Moreover, given a major haplogroup is present 
at a low percentage in a population as for instance haplogroup W 
in the present study samples (~2%, Table S9), refined haplogroup 
association testing would have to be done in a sample with a much 
larger sample size as those of the present study samples. 
Nevertheless, variants biologically relevant for obesity might be 
found in the sub-haplogroups and hence these variants or sub- 
haplogroups might have been masked by association testing of 
only the respective major haplogroup. 

Pertaining to the D-loop variants, two (m. 16292C/T and 
m.l6189T/C) of the 252 detected variants were nominally 
associated with obesity among the 191 cases and 191 controls of 
the D-loop sample, m. 16292C/T was only found in eight controls 
of which five had haplogroup W that was initially overrepresented 
among the controls of the CC GWAS sample (p = 0.048; D-loop 
sample: p = 0.062). As haplogroup W could not be confirmed 
independently in the present study, a follow-up of this variant does 
not seem useful. 

m.I6189T/C is located with a poly-C tract between m.l6184 
and m. 16193. This variant as well as m.l6189T/CC and 
m. 1 6 1 89delT led to an uninterrupted poly-C tract given no other 
transition or insertion except of C has occurred between m. 16184 
and m. 16 193 (Fig. 1). In the present study, frequency of the 
uninterrupted poly-C tract tended to be higher among the cases. 
Parker et al., by contrast, reported the uninterrupted poly-C tract 
to be associated with leanness among 161 Australian mothers and 
their 20-year-old offspring [38]. 

Moreover, all individuals of the present D-loop sample with an 
uninterrupted poly-C tract showed length heteroplasmy at this 
tract. This might be the result of strand slippage during the 
replication process generating tracts of variable length of 
predominantly 10 to 12 Cs [39,40]. Moreover, length patterns 
were different between maternal lineages but nearly identical 
within a maternal lineage [39]. In addition, the termination 
associated element (TAS, m.l6157 to m.l6172, Table 1), which is 
involved in premature determination of the H-strand synthesis in 
order to create the triple stranded D-loop [28,41,42] is located 
near this C tract. As binding of proteins at the TAS element was 
shown, binding capacity and thus mtDNA transcription and 
replication might be influenced by the nearby uninterrupted C 
tract [43]. Among 837 healthy adult Taiwanese (mean 
BMI = 24.5 kg/m^), the lowest mtDNA content was found among 
individuals with an uninterrupted C tract compared with 
individuals of the wild-type or an otherwise interrupted C tracts 
[44]. Nevertheless, mean BMI between the three groups was 



similar [44]. Further investigation is needed whether 1) the 
detected o\-crr(;j)rcseiitation by trend of the uninterrupted C tract 
in the ()l)esc can be confirmed in an independent sample and 2) 
the uninterrupted C-tract has an influence on mtDNA levels and 
BMI among Europeans. 

Finally, as some D-loop variants only occurred at very low 
frequencies (in only one or two individuals), a refined association 
analysis of these variants potentially in combination with the 23 
functionally relevant regions in a larger study sample could be 
subject of further investigation. 

All in all, though foUow-up of some D-loop variants stiU is 
conceivable, our hypothesis of a contribution of variation in the 
exclusively maternally inherited mtDNA to the observed greater 
correlations in BMI between mothers and their offspring than 
between fathers and their offspring could not be substantiated by 
the findings of the present study. 
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